Picture for Raluca Ada Popa

Raluca Ada Popa

Hidden Thoughts Are Not Secret: Reasoning Trace Exposure in LLMs

Add code
May 30, 2026
Viaarxiv icon

Web Agents Should Adopt the Plan-Then-Execute Paradigm

Add code
May 14, 2026
Viaarxiv icon

GradShield: Alignment Preserving Finetuning

Add code
May 13, 2026
Viaarxiv icon

Onyx: Cost-Efficient Disk-Oblivious ANN Search

Add code
Apr 22, 2026
Viaarxiv icon

Opal: Private Memory for Personal AI

Add code
Apr 02, 2026
Viaarxiv icon

MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents

Add code
Dec 11, 2025
Figure 1 for MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents
Figure 2 for MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents
Figure 3 for MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents
Figure 4 for MiniScope: A Least Privilege Framework for Authorizing Tool Calling Agents
Viaarxiv icon

Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test

Add code
Jun 08, 2025
Figure 1 for Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test
Figure 2 for Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test
Figure 3 for Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test
Figure 4 for Auditing Black-Box LLM APIs with a Rank-Based Uniformity Test
Viaarxiv icon

An Approach to Technical AGI Safety and Security

Add code
Apr 02, 2025
Viaarxiv icon

A Framework for Evaluating Emerging Cyberattack Capabilities of AI

Add code
Mar 14, 2025
Viaarxiv icon

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Add code
Oct 16, 2024
Figure 1 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Figure 2 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Figure 3 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Figure 4 for JudgeBench: A Benchmark for Evaluating LLM-based Judges
Viaarxiv icon